The minimum description length principle for pattern mining: a survey

نویسندگان

چکیده

Abstract Mining patterns is a core task in data analysis and, beyond issues of efficient enumeration, the selection constitutes major challenge. The Minimum Description Length (MDL) principle, model method grounded information theory, has been applied to pattern mining with aim obtain compact high-quality sets patterns. After giving an outline relevant concepts from theory and coding, we review MDL-based methods for different kinds various types data. Finally, open discussion on some regarding these methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Minimum Description Length Principle

The minimum description length (MDL) principle states that one should prefer the model that yields the shortest description of the data when the complexity of the model itself is also accounted for. MDL provides a versatile approach to statistical modeling. It is applicable to model selection and regularization. Modern versions of MDL lead to robust methods that are well suited for choosing an ...

متن کامل

A tutorial introduction to the minimum description length principle

This tutorial provides an overview of and introduction to Rissanen’s Minimum Description Length (MDL) Principle. The first chapter provides a conceptual, entirely non-technical introduction to the subject. It serves as a basis for the technical introduction given in the second chapter, in which all the ideas of the first chapter are made mathematically precise. This tutorial will appear as the ...

متن کامل

Using Evolutionary Programming and Minimum Description Length Principle for Data Mining of Bayesian Networks

We have developed a new approach (MDLEP) to learning Bayesian network structures based on the Minimum Description Length (MDL) principle and Evolutionary Programming (EP). It employs a MDL metric, which is founded on information theory, and integrates a knowledge-guided genetic operator for the optimization in the search process. In contrast, existing techniques based on genetic algorithms (GA)...

متن کامل

Optimization Framework with Minimum Description Length Principle for Probabilistic Programming

Application of the Minimum Description Length principle to optimization queries in probabilistic programming was investigated on the example of the C++ probabilistic programming library under development. It was shown that incorporation of this criterion is essential for optimization queries to behave similarly to more common queries performing sampling in accordance with posterior distribution...

متن کامل

Minimum Description Length Principle for Linear Mixed Effects Models

The minimum description length (MDL) principle originated from data compression literature and has been considered for deriving statistical model selection procedures. Most of the existing methods that use the MDL principle focus on models with independent data, particularly in the context of linear regression. This paper considers data with repeated measurements and studies the selection of fi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Data Mining and Knowledge Discovery

سال: 2022

ISSN: ['1573-756X', '1384-5810']

DOI: https://doi.org/10.1007/s10618-022-00846-z